🐿️ Scour
Browse
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🚀 CUDA Kernels
GPU Programming, Memory Optimization, Parallel Computing, Performance Tuning
Hot
Past Hour
Today
This Week
This Month
Subscribed Feeds
All Feeds
Show HN: We Built a Serverless GPU Platform with Fast Cold Starts
dat1.co
·
1d
·
Discuss:
Hacker News
🔱
Triton
Custom CUDA kernels for small-batch ML on GTX 1650: Memory hierarchy optimization and vectorization techniques
reddit.com
·
6d
·
Discuss:
r/programming
🔱
Triton
ik_llama.cpp and Qwen 3 30B-A3B architecture.
reddit.com
·
1d
·
Discuss:
r/LocalLLaMA
🔱
Triton
Deriving Rope the Proper Way
nor-blog.pages.dev
·
17h
·
Discuss:
Hacker News
🧮
Mathematics
Skimpy HBM Memory Opens Up The Way AI Inference Memory Godbox
nextplatform.com
·
2d
·
Discuss:
Hacker News
🔧
Hardware
A lightweight library for portable low-level GPU computation using WebGPU
github.com
·
6d
·
Discuss:
Hacker News
🔱
Triton
Linus Torvalds still uses an AMD RX 580 from 2017 — also ditches Apple Silicon for an Intel laptop
tomshardware.com
·
23h
·
Discuss:
Hacker News
🔧
Hardware
When Your Database Lives in CPU Cache (Because Why Not?)
blog.canoozie.net
·
11h
·
Discuss:
Hacker News
🦀
Rust
Step3
stepfun.ai
·
18h
·
Discuss:
Hacker News
📱
Edge AI
AMD Ryzen Threadripper 9980X and 9970X Review: Zen 5 Powers Gains
storagereview.com
·
1d
·
Discuss:
Hacker News
🔧
Hardware
Kaizen (YC X25) Is Hiring Engineers to Build Browser Agents That Work
ycombinator.com
·
21h
·
Discuss:
Hacker News
📱
Edge AI
Introduction to Unikernel: Building, Deploying Lightweight, Secure Applications
tallysolutions.com
·
37m
·
Discuss:
Hacker News
🦀
Rust
Releasing open weights for FLUX.1 Krea
krea.ai
·
1d
·
Discuss:
Hacker News
,
r/LocalLLaMA
👁️
Computer vision
MLCommons Releases MLPerf Client v1.0
mlcommons.org
·
7h
·
Discuss:
Hacker News
📱
Edge AI
Isle FPGA Computer
projectf.io
·
4h
·
Discuss:
Hacker News
🔧
Hardware
I spearheaded the development of an AI app that acts as a VFX Supervisor for Filmmakers and VFX Artists. This is what I learned in the process.... and this is o...
youtube.com
·
6h
·
Discuss:
r/programming
🎨
Neural Rendering
Kimi K2 vs Grok 4: Who’s Better at Real-World Coding Tasks with Tools?
forgecode.dev
·
8h
·
Discuss:
r/LocalLLaMA
🦀
Rust
Sub-millisecond GPU Task Queue: Optimized CUDA Kernels for Small-Batch ML Inference on GTX 1650
reddit.com
·
6d
·
Discuss:
r/programming
🔱
Triton
Small Models, Big Wins: Agentic AI in Enterprise Explained
blog.premai.io
·
2h
·
Discuss:
Hacker News
📱
Edge AI
Maybe the Fastest Disk Usage Program on macOS
healeycodes.com
·
1d
·
Discuss:
Hacker News
,
r/programming
🦀
Rust
Loading...
Loading more...
Page 2 »